Skip to content

Add Qwen3-Next tests to MaxText_MoE DAG#1194

Open
Rohan-Bierneni wants to merge 1 commit intomasterfrom
rbierneni-qwen3-next-tests
Open

Add Qwen3-Next tests to MaxText_MoE DAG#1194
Rohan-Bierneni wants to merge 1 commit intomasterfrom
rbierneni-qwen3-next-tests

Conversation

@Rohan-Bierneni
Copy link
Collaborator

@Rohan-Bierneni Rohan-Bierneni commented Feb 20, 2026

Description

Onboard qwen3-next tests to XLML DAG.

The script that the DAG will run tests:

  • Forward Pass Logit Check
  • Synthetic Data Train Workload
  • Simple Fine Tuning Train Workload
  • Decoding Workload

Tests

Made a dummy DAG to run the tests in local XLML. Will remove once verified that tests pass
The output of the DAG:

Note: Caching is added to the model in this pr: AI-Hypercomputer/maxtext#2971. It has not merged into main and thus decoding test will fail in this DAG. However, once that pr merges the command will work.

Checklist

Before submitting this PR, please make sure (put X in square brackets):

  • I have performed a self-review of my code.
  • I have necessary comments in my code, particularly in hard-to-understand areas.
  • I have run one-shot tests and provided workload links above if applicable.
  • I have made or will make corresponding changes to the doc if needed.

Copy link
Collaborator

@RissyRan RissyRan left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM! Once the test is green, I will click the approval. Thanks!

"cluster": XpkClusters.TPU_V5P_8_CLUSTER,
"time_out_in_min": 90,
},
"qwen3-next-80b": {
Copy link
Collaborator

@shuningjin shuningjin Feb 20, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can you add yourself as the owner for qwen3-next-80b?

We can do something like the maxtext_end_to_end DAG, where the owner is different across tests:

"gemma-2b": {
"owner": test_owner.MOHIT_K,
"commands": ["bash tests/end_to_end/tpu/gemma/2b/test_gemma.sh"],
},
"gemma2-2b": {
"owner": test_owner.HENGTAO_G,
"commands": [
"bash tests/end_to_end/tpu/gemma2/2b/test_gemma2_to_mt.sh",
"bash tests/end_to_end/tpu/gemma2/2b/test_gemma2_to_hf.sh",
],
},

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Sure, I have added myself for the qwen3 next test. For the other tests, i let it default to you.

time_out_in_min=90,
test_name="maxtext_qwen3_next_80b_test",
run_model_cmds=(
f"export HF_TOKEN={HF_TOKEN}; export BASE_OUTPUT_PATH=$GCS_OUTPUT; bash tests/end_to_end/tpu/qwen/next/qwen3-next-80b-a3b/2_test_qwen3_next_80b_a3b.sh",
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

To confirm this test is only checking for runtime errors ? do you also plan on adding a tests for logits now or in the future ?

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The test is running this script: https://github.com/AI-Hypercomputer/maxtext/blob/040c71b73616d768b141da07292fb0417164846c/tests/end_to_end/tpu/qwen/next/qwen3-next-80b-a3b/2_test_qwen3_next_80b_a3b.sh

It does:

  • Forward pass logit check
  • train workload
  • finetuning workload
  • decoding workload

It should cover logit comparision, runtime errors, config checks, train/decoding support, etc. Pretty much all end-to-end model checks.

Copy link
Collaborator

@parambole parambole left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM. I have left a few comments. PTAL

Add qwen3-next to MaxText_moe DAG

Run pylinter

add separate test owner logic
@Rohan-Bierneni Rohan-Bierneni force-pushed the rbierneni-qwen3-next-tests branch from acaac29 to e029f69 Compare February 20, 2026 23:48
@Rohan-Bierneni Rohan-Bierneni self-assigned this Feb 20, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants